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Abstract 



We have investigated the binding interaction between the bacteriophage A repressor CI 
and its target DNA using total internal reflection fluorescence microscopy. Large, step- 
wise changes in the intensity of the red fluorescent protein fused to CI were observed 
as it associated and dissociated from individually labeled single molecule DNA targets. 
The stochastic association and dissociation were characterized by Poisson statistics. Dark 
and bright intervals were measured for thousands of individual events. The exponential 
distribution of the intervals allowed direct determination of the association and dissociation 
rate constants, kg, and fed respectively. We resolved in detail how k^, and k^ varied as a 
function of 3 control parameters, the DNA length L, the CI dimer concentration, and 
the binding affinity. Our results show that although interaction with non-operator DNA 
sequences are observable, CI binding to the operator site is not dependent on the length of 
flanking non-operator DNA. 
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Introduction 

The CI protein, the product of the cl gene of bacteriophage A, has long served as an impor- 
tant model system for understanding protein-DNA interactions and gene regulation yj, y)- 
In A phage, CI monomers assemble into active dimers, Cl2's, that bind to three DNA sites 
(OrI, Or2 and Or3) on the right operator Or, which determines genetic switching between 
the lysogenic and lytic lifestyles. Adjacent dimers interact cooperatively, and these higher 
order complexes can interact with similar complexes bound to 3 related tandem operators 
OlI, Ol2 and Ol3 ~2.4 kilo basepairs (bp) away ([H). Understanding the dynamics of 
binding is important for unraveling the relevant physical mechanism of genetic control, as 
well as broader problems such as macro-molecular recognition. 

The interactions between CI and its targets may be the best studied repressor-operator 
system in molecular biology (0, 0, [i], 0, Q) . However, we know nothing about how individual 
stochastic binding events sum to yield the ensemble averages revealed by nitrocellulose 
filter binding assays (jsl), DNase footprinting electrophoretic mobility shifts (jiol ). and 
isothermal titration calorimetry (0). In recent years, several single- molecule techniques 



have been introduced to study protein-DNA interactions (11 ll Il2l . Il3l . Il4l ). The tethered 



particle motion technique fjl5l ) provides dynamic information about single-molecule length 



at 100 nm scale. Fluorescence-aided single molecule sorting (jl6l. llTl) provides the fraction of 



molecules bound in the reaction at the single molecule level. A very promising technique is 



single-molecule fluorescence microscopy (SMFM) (jl8l ). which is capable of imaging the same 
molecule for long periods. In order to study target specificity and binding on a molecule- 
by-molecule basis, we used the SMFM method, tethering the target DNA individually 
labeled with single fiuorophore to a surface, and labeling the CI protein by fusion to a red 



fluorescent protein, tdimer RFP (jl9l ). 

As noted above, because each DNA molecule can interact with several CI2 molecules, 
this interaction may have multiple outcomes, with variable numbers of CI2 molecules inter- 
acting with their targets and each other - the regulatory circuit is complex. To investigate 
these interactions at the most basic level, we restricted our studies to the binding of CI2 
to a single operator sequence, OrI, thus avoiding the complexities of these higher order 
interactions. 

In experiments of these kinds, large uncertainties often arise from difficulties in com- 
paring substrate and protein concentrations [CI2] from sample to sample, and experiment 
to experiment. We addressed this problem by arranging different DNA populations in an 
array geometry on the same substrate and immersing the DNA in the same protein solu- 
tions. This allowed one variable - e.g. the DNA length L - to be varied while holding [CI2] 
fixed. With these DNA arrays, we observed reproducible variations of the dissociation and 
association rate constants with respect to both L and [CI2]. 

Results from DNA molecules with and without OrI sites, with OrI fianked by a range of 
different nonspecific DNA lengths, and with CI repressors from both wild-type and a tight- 
binder mutant were compared. Large step-wise changes in fluorescence intensity enabled 
us to resolve clearly the association and dissociation of single CI2 molecules from their 
targets. This proved valuable in the stochastic analysis of the intensity vs. time traces. 
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We measured dissociation as well as association rate constants kd and fca- The values 
we measured agree qualitatively, although not quantitatively, with those obtained in bulk 
assays, which provide ensemble averages. We found that operator specific rate constants 
had very little dependence on DNA length. With increasing affinity of a mutant repressor 
for its target, the dissociation rate constant becomes smaller while the association rate 
constant increases. We also found that although CI binding to nonspecific DNA sequences 
is easily observed, nonspecific DNA fianking the OrI sequence has no infiuence on specific 
rate constants, suggesting that ID searching along nonspecific sequences is not an important 
aspect of repressor targeting over the DNA length L studied here. 

Materials and Methods 
DNA and CI 

The OrI sequence was inserted into the pGEM®-7Zf(+) vector (Promega Corporation, 
Madison WI) at the Apal site. The local sequence is 5'-...GGG CCC TTA TCA CCG 
CCA GAC GTA GGG CCC. ..-3', where GGG CCC is the Apal recognition site. The 
construct was verified by sequencing. The polymerase chain reaction (PGR) was used to 
amplify the OrI sequence together with the fianking sequences using the Expand HiFi 
PGR System (Roche Diagnostics Corporation, Indianapolis, IN). The PGR product was 
purified with Qiagen QIA PGR Kit (Qiagen Inc. Valencia, GA). As shown in Fig. [TJ\, a 
common left primer (5'-CAG TCA ATT TGT AAT AGG ACT GAG TAT AGG G-3') was 
used with different right primers (Table [1]) to create DNA of varying lengths. The common 
left primer has a biotin group attached to the 5' end to immobilize the DNA. A green 
ffuorescent Alexa Fluor® 488 was attached at the base to the ninth T from the 5' end of 
the left primer. All primers, including the modifications were ordered from Integrated DNA 
Technologies, Coralville, lA. In particular, the internal Alexa Fluor 488 dT modification 
is available through off-catalog order. We call the DNA with the OrI sequence operator 
(op) DNA. We also made DNA molecules without the operator sequence using the original 
vector as PGR template. We call them non-operator (nop) DNA. DNA lengths were verified 
using electrophoresis. 

All CI protein preparations used in our single molecule investigations were fusions with 
tdimer (a red fiuorescent protein, tdimer(2)12 in (jiol)). HN-tagged WT CI constructs, Lys- 
34 CI, Cys-88 CI and their respective tdimer fusions were cloned into the PROTet"""^ 6xHN 
cloning vector by standard methods (j2ol ). All plasmid inserts were verified by sequencing. 
CI function was established by demonstrating that strains carrying each construct in the 
PROTet^^ vector (CLONTECH Laboratories, Inc. Mountain View, GA) displayed the 
expected sensitivity to Xvir and z^^^, and resistance to A PaPa superinfection, hallmarks 
of functional CI. CI and the tdimer: CI fusions were then purified by Talon™ column 
chromatography (CLONTECH Laboratories, Inc. Mountain View, CA). The molarities 
of purified fractions were established by quantitative densitometry of each preparation by 
comparison to known quantities of bovine serum albumin run in adjacent lanes of SDS- 
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polyacrylamide gels. Molecules of tdimer:Cys-88 CI, whose threonine residue at position 
88 had been replaced by a cysteine, were shown to be dimers by electrophoresis in non 
reducing SDS-polyacrylamide gels, as expected (1211 ) . Purified fractions were tested for in 
vitro activity by standard gel-shift assays using a 200 bp DNA fragment containing the A 
Or operator consensus sequence and non-specific competitor DNA as a control for sequence 
specificity fj22l ). The tdimer:CI fusions were as active or more so than respective CI proteins 
on a molar basis. 

The tdimer:CI preparations always contained a contaminating protein with a molecular 
mass of ~25 kD. Because this is close to the mass of native CI, which would act as a 
nonfiuorescent competitor in our experimental system, we cut out both the tdimer:CI band 
and the ~25 kD band from a denaturing polyacrylamide gel and identified them by mass 
spectrometry. The fusion product exhibited the expected peptides from the tdimer and CI. 
There was no CI signal in the contaminating band. The identified peptides were entirely 
from the tdimer. We suspect that this contaminant arises either by proteolysis (unlikely) 
or premature chain termination of the tdimer as it is synthesized. 



Preparation of DNA arrays in channels 

DNA molecules were immobilized on coverslip surfaces. Glass surface treatments fol- 
low (j23l ). Coverslips (22 mm x 40 mm) were cleaned by first soaking in 5% Contrad® 
70 (Decon Laboratories, Inc. King of Prussia, PA) for 2 hr, followed by rinsing and im- 
mersing in deionized water for 1 hr. They were then rinsed with and immersed in acetone 
for 5 min., followed by 5 min. in Vectabond (Vektor Laboratories, Burlingame, CA) 
solution (2% v/v in acetone). They were then washed with acetone and deionized water 
and blow-dried with clean compressed air. The coverslip surfaces were exposed to polyethy- 
lene glycol (PEG) coating solution, which was prepared as 25% (w/v) mPEG-SPA (Nektar 
Therapeutics, Huntsville, AL) and 0.25% (w/v) biotin-PEG-NHS (Nektar Therapeutics, 
Huntsville, AL) in 50 mM NaHCOs (pH 8.5). After incubating in a humid chamber for 3 
or more hr, the coverslip surfaces were rinsed with deionized water and kept in deionized 
water for an additional 2 hr. After blow drying, the coverslips were exposed to 0.1 mg/ml 
streptavidin solution for 15 min, rinsed with deionized water and blow-dried again. 

Solutions of biotin labeled DNA were pipetted onto a coverslip to form a typically 4x6 
array of patches (Fig.[Tj3). After a 15-min. incubation in a humid chamber, DNA molecules 
were immobilized by biotin-streptavidin interaction. To avoid mixing between different 
DNA populations, excess solution on the coverslip was blotted dry with bibulous paper. 
Binding buffer (10 mM PIPES, 50 mM KCl, 5% DMSO, 0.1 mM EDTA) was pipetted 
to the same spots and blotted dry repeatedly to remove unbound DNA. We divided the 
array into 4 channels with walls of vacuum grease. The channels were capped with a second 
coverslip coated with PEG and filled with tdimer: CI solution in binding buffer before sealed 
with beeswax. 

Protein absorption on the glass surfaces was minimized by the PEG coating. Non- 
specific absorption could change the actual protein concentration at low protein concentra- 
tion. We observed that protein absorption was at most one molecule per /im^. The liquid 
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layer was about 200 fim thick. At the lowest CI2 concentration of 70 pM, we estimate that 
non-specific binding could decrease the concentration by at most 20%, which would lower 
the measured association rate constants by only a small amount. 

To minimize tdimer:CI denaturation, protein solutions filled into the channels were 
freshly diluted ~1000-fold into binding buffer from a small amount of the nM stock solution 
just prior to each experiment. Concentrations of CI2 were calculated assuming a monomer- 



dimer dissociation constant of 20 nM (1241 ). By following this protocol, each patch of DNA in 
the same channel in Fig. [T]B was exposed to the same [CI2]. From one channel to the next, 
[CI2] increased by a factor of ~3. This allowed us to reduce significantly problems associated 
with differences in surface preparation and protein concentration. Hence, the 24 patches 
with varying L and [CI2] were investigated on the same substrate in one experimental 
run. The patches of DNA molecules were incubated with CI solution for ~60 min before 
acquiring fluorescence microscopy images. 

Time-lapse series were captured for all DNA lengths immersed in a selected CI con- 
centration for each prepared sample. A CI concentration that is too low resulted in very 
few binding events, while a concentration that is too high resulted in multiple CI2 bind- 
ing to a single DNA. The CI concentration that maximized single-Cl2 DNA interactions 
was selected. An experimental run was typically flve to six hours for each prepared array. 
During each run six 40- or 60-min time-lapse image series were acquired. The CI protein 
degradation was insigniflcant as judged by fluorescence intensity during the experiment. 

Single Molecule Fluorescence Microscopy 

DNA was immobilized to the coverslip surface and immersed in CI solution. Snapshots 
and time-lapse images were captured at room temperature by total internal reflection flu- 
orescence (TIRF) microscopy (Fig. [T]C) with a Nikon TE2000-U inverted microscope. Au- 
tomated image acquisition was made possible with HF204 emission fllter wheel and HI 12 
focus drive (Prior Scientiflc Inc. Rockland, MA). The sample was illuminated with the 
argon 488 nm laser line incident near the critical angle. 

The evanescent wave depth S equals - — ^=f=4 — where A is the light wavelength in 
vacuum, 488 nm, n is the refractive index of glass, 1.52, 9c is the critical angle from glass to 
water, 61.4°. The excitation intensity at the glass-water interface is also a monotonically 



decreasing function of the incident angle that can be theoretically calculated (l25l ). We 
calibrated the incident angle by measuring the intensities of 20 nm fluorescent spheres 
immobilized on a glass surface. We estimated that the evanescent wave depth was 300 nm 
or deeper in our experiments. Thus the entire length of the DNA was illuminated. We also 
note that the longer DNA's, especially those longer than the 200 bp persistence length, 
describe a volume that is on average closer to the surface, and hence on average the bound 
CI molecules were in the evanescent fleld. 

Both Alexa Fluor 488 and tdimer were excited at 488 nm. The emission of Alexa Fluor 
488 peaks around 517 nm (green), while the emission of tdimer peaks around 579 nm 
(red). Dichroic mirror, green and red emission fllters were used to separate excitation light 
and fluorescence from different molecules (Z488rdc, HQ525/50m, HQ645/75m, Chroma 
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Technology, Rochingham, VT). Images were recorded with an EM-CCD camera (DV877- 
DCS-BV, Andor Technology, South Windsor, CT). To minimize fluorophore bleaching, the 
laser power density was kept at ~19 W/cm^ over an illuminated area of ~ 60 x 40 /im^. 

For still images, a green frame was acquired immediately followed by the acquisition 
of a red frame, each with 1 s exposure time. For time-lapse images, a single green frame 
was only taken once at the beginning of each time-lapse run with 1 s exposure time, since 
the DNA molecules were immobilized. Red time- lapse images were acquired with 0.1 s 
exposure time every 10 or 15 s. The sample is only illuminated while a picture is being 
taken to minimize tdimer bleaching and therefore to maximize event recording. In our 
experiment, the time it takes for a bound CI2 to dissociate from DNA was on the order of 
minutes. We chose [CI2] so that it took about the same amount of time for an unbound 
CI2 to associate with DNA. In the supplemental movie, the DNA molecules (marked with 
blue circles) could be seen dynamically interacting with CI2. 



Image analysis 

First, the background fluorescence was subtracted from every image. The background 
fluorescence mainly comes from the buffer autofiuorescence, which is proportional to the 
illumination intensity. The laser illumination has a two dimensional (2D) Gaussian profile. 
The image background was determined by fitting the image, most of whose pixels were 
background pixels, to a 2D Gaussian function. After background subtraction, the image 
is also normalized by the same 2D Gaussian profile to compensate for the nonuniform 
illumination. Only the center of the image, the region with the highest signal to noise 
ratio, was used for further analysis. 

In each image, fluorescent dye or protein molecules appear as diffraction-limited bright 
spots. These spots were identified by thresholding. The intensity of a bright spot was 
calculated as the average intensity of the 3 by 3 pixels around the local maxima above 
background. In the still images, the red intensity was normalized by the mean intensity of 



tdimer: Cys-88 CI molecules, which form covalent dimers independent of concentration (1211 ). 
as dimers are the smallest active units interacting with DNA. From now on, we will mention 
tdimer: CI as CI in this paper, since all our CI proteins are tdimer fusions. 

Cross-correlation was calculated between the green DNA image and the red CI image. 
Colocalizeation of CI2 and DNA resulted in strong cross-correlation peak, which was also 
used to align the green and red images. By overlaying the aligned green image over the 
red, we easily identified the binding events of CI2 to DNA as the colocalization of green 
and red spots, which acquired a yellow hue. 

Time-lapse images were also aligned to correct horizontal drift. For time- lapse images, it 
was often observed that at the locations of green DNA molecules, the red intensity showed 
step-wise switches between zero and non-zero levels. We identify these changes with the 
association and dissociation of red fluorescent CI2 to and from DNA. 

A semi-automated computer program was developed to divide the red intensity vs. time 
curve {I{t)) into a series of relative flat segments connected by sharp transitions. Before 
analyzing the raw trace of red intensity I{t), we measured two intensities Jq and Ji in several 
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manually selected traces. Iq is the average quiescent intensity when no CI2 proteins are 
bound to the DNA. Ii is the average intensity when only one CI2 is bound. The values of 
/q and Ii are reproducible from run to run and are well defined throughout the experiment. 
Assuming the intensity is linearly scaled with number of CI2 bounded, the average intensity 
of more than one CI2 bound to the DNA can be estimated. Given the raw trace of the 
intensity I{t) vs. t, the program achieves automatic segmentation in two steps. With the 
average intensities /.„ {n = 0,1,2...) known, the program readily identifies the portions 
of I(t) belonging to no CI2, one CI2 and multiple CI2 states. However, at this stage, the 
actual transition time between these states may only be estimated roughly. 

To fix the transition time accurately, we employ a minimum-fluctuation method. At 
such low fluorescence level of a few CI2 molecules, the recorded intensity fluctuation is 
dominated by the dark current of camera. The intensity root-mean-square (rms) amplitude 
(To is constant regardless of the CI2 number, as long as the CI2 number remains small and 
constant. In a raw trace I{t), when a transition occurs (CI2 number changes), the rms 
amplitude increases significantly. We consider I{t) in a time interval [^1,^2]- If the rms 
amplitude a within [^1,^2] is less than 2ao, this interval is treated as a continuation of the 
previous state. However, if cr > 2ao, the program recognizes that a transition has occurred. 
It then picks an arbitrary point within [^1,^2]? and calculates the rms amplitude ai within 
[^1,^3] and (T2 within [^3,^2]- By minimizing the quantity ^^^'^^ with respect to 

^3, the program then determines accurately the actual transition time. In addition we also 
recognize small clusters of I(t) data points with significantly different mean values from 
the surrounding data points as very short segments. 

Because low signal to noise level can confuse the program, the segmented curves were 
visually checked and manually corrected if necessary. Only segments that begin and end 
within the complete trace of I{t) were used for our histograms. 

Gel-shift assay 

The same fluorescent DNA and CI in single-molecule experiments were used for the gel- 
shift assays. DNA and CI were mixed in binding buffer supplemented with 100 yUg/ml 
bovine serum albumin (BSA). The binding buffer is also the buffer used in the single- 
molecule measurements. The concentration of the DNA was 20 nM in every reaction 
while the concentrations of CI2 varied (either 5, 36.5 or 452 nM). The reaction mixtures 
were incubated at room-temperature. After about an hour, 20 fil reaction mixtures were 
electrophoresed in Bio-Rad 4-20% Gradient TBE gels (Bio-Rad, Hercules, CA) in a 4°C 
cold room. The gels were stained with ethidium bromide for DNA. The gel images were 
taken under UV illumination. 

Stochastic series 

To analyze how the intensity time trace I{t) is informed by the transition rates and to 
define some terms, we describe transitions between states -DP„ (defined as a DNA bound 
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by n Cl2's) as the time-dependent stochastic series 

An Ai Ari— 1 An , . 

DPo^DP,^DP2--- ^ DP^ ^ ■■■ (1) 

Ml M2 A'n Mn + 1 

where A„ and /i„ are the forward and backward transition rates governing the time-evolution 
of the probabihty of finding a complex DPn at time t. 

A powerful feature of dynamic SMFM is that it allows a single transition in the series 
Eq. [T]to be studied in isolation. In general, the time trace I{t) at a colocalized site shows 
both single- (Fig. [3JA.) and multiple-step transitions (Fig. |3j3) relative to the background 
intensity. We used a semi-automated computer program that efficiently distinguishes large, 
step-wise transitions from small-amplitude fluctuations. 

By collecting the set of traces exhibiting single-step transitions, we accumulated statis- 
tics on the transition DPq ^ DPi. The "up" and "down" transitions separates the bright 
durations tbright and dark durations tdark? whose probability distribution can be described 
by P(tdark) = Aoe-^o*dark and P(tbright) = fiie-^''^+^^^^''-^'^, respectively (Fig. [3lC,D). From 
the state DPi, the molecule complex goes to DPo with rate constant /xi. DPi also goes to 
DP2 with rate constant Ai. During the same period of time, the number of DPi to DPq 
transitions was counted as while the number of DPi to DP2 transitions was counted as 
We then have = From the individual traces of I{t) at each DNA bound to CI2 
(200 ~ 300 in the fleld of view), we accumulated hundreds of measurements of tdark and 
^bright in a single time- lapse data set. Assuming Poisson processes, Aq and /ii -|- Ai was ob- 
tained by fltting the distributions of tdark and tbright to exponential distributions according 



to the unbiased estimation in (1261 ). We obtain /ii by multiplying the second exponential 
constant with 7Vji+^- Hence the histograms of tdark and tbright in the flltered set allow Aq 
and fii to be measured directly (Fig. [3p,D). 

If the transitions to DP2, DP^, ■ ■ ■ are negligible, the association and dissociation rate 
constant /Ca and k^, of the reaction 

DPo + Ch^DPi (2) 
are just fca = Ao/[Cl2], = fii, with the dissociation constant Kr, = kd/ks,. 



Results 

Snapshots and time-lapse images of CI interacting with DNA 

In a typical snapshot with the red channel image overlayed on the green channel image, 
there are many bright spots (Fig. [Tp). The color of bright spots fall into three easily 
separable classes: green, red and yellow, identifled as DNA, CI and DNA-CI, respectively. 

The TIRF technique allows us to capture simultaneously the transitions in each of 
the DNA molecules within the fleld of view. When observing single fluorophore-labeled 
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molecules, the blinking and bleaching of fiuorophores might interfere with the measure- 
ments. 

Molecule bleaching was moderate in our experiments. The average intensity of all 
fluorescent molecules in the field of view decreased by 10 to 30% after 100 seconds of 
continuous illumination (Fig. [2]A.). The intensities of single Alexa Fluor 488 molecules were 
very stable, except for the occasional characteristic single-step bleaching events (Fig. [23, 
C). The fluorescent intensity of tdimer is more critical. Blinking and bleaching of tdimer 
might be confused with binding and unbinding of CI2 from DNA. To control for this 
possible problem, tdimer: Cys-88 CI molecules non-specifically bound to a glass surface 
were investigated. We could identify single tdimer: Cys-88 CI dimer molecules as individual 
bright spots with similar intensities. With time, the intensities of these spots decreased 
gradually (Fig. [2p to G). It is worth emphasizing that the molecules were only illuminated 
when the image was acquired during the process of time-lapse image acquisition. The 
accumulated exposure time during a 40- or 60-min movie was about 100 s. With tdimer 
bleaching slowly and gradually during this period of time, the binding and unbinding of 
CI2 can be easily resolved. 

In the dynamic experiments, we captured 40 or 60 min movies at each of the 6 patches 
in the same channel. Fig. [3]A. shows a typical time trace of the fluorescence signal from one 
molecule over a 40 min interval. The baseline (minimum signal intensity) corresponds to 
the state with no CI2 bound to DNA, while the plateaus at finite intensity represent the 
state with one CI2 bound to DNA. At very low [CI2], most DNA has not bound CI2 over 
the duration of data acquisition. However, at high [CI2] , several finite intensity states were 
observed in a single time-trace, resulting from multiple CI2 interacting with a single DNA 
molecule. We adjusted the [CI2] to optimize the number of single CI2 binding events. In 
each field of view, there are time traces for ~200 molecules on average, each displaying 1 - 
10 bright states. As explained above, the "filtered set" only contains the traces with single- 
step transitions. The histograms of the sets of tdark and tbright in Fig. [3P and D are well 
fitted by an exponential distribution, consistent with a Poisson processes and implying that 
association and dissociation are best modeled as single step processes. This then allows Aq 
and fii in each of the populations to be determined. 

In some data sets, similar dark-bright red intensity switching patterns were observed 
at locations where there was no green fluorescence. Some of these locations might have 
harbored a bleached DNA molecule, while others might simply have been CI molecules 
non-specifically bound to the surface. To make sure we were looking at the interaction of 
CI2 to DNA, we counted events only at locations where green fluorescence was high. 

CI binds DNA both with and without the operator sequence, 
showing different binding profiles 

We first discuss measurements of binding profiles for single DNA molecules by looking at 
still images. 

Recall that the red intensities have been normalized using the intensity of single CI2 
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molecules. A series of intensity histograms is shown in Fig. H] for CI2 interacting with op 
DNA. The histogram consists of two peaks, a narrow one centered around the background 
intensity (dashed line) identifies the population of DNA without CI2 bound to it. For DNA 
from 78 to 200 bp, the position of the second peak is consistent with one CI2 bound to DNA. 
These peaks overlap with the histogram of single Cys-88 CI dimer molecules, confirming 
that CI bind to DNA as a dimer. With further increase in L, the peak broadens and shifts, 
implying that two or more Cl2's bind to each DNA molecule as the CI2 concentration 
increases 

As mentioned above, the array design allows the binding probability to be investigated 
as both L and [CI2] are varied. A revealing way to view the trends is to display in the 
(L, J) plane the intensity histogram as a heat map, with blue indicating zero counts and red 
indicating large numbers of counts (Fig. E]). The 6 vertical strips within each panel display 
the normalized event-count distribution in the 6 patches (with successive increasing L) all 
exposed to the same [CI2]. The 4 upper panels are for nop DNA while the lower panels 
are for op DNA. With [CI2] held fixed, more CI2 was observed to bind to op DNA (lower 
panels), as expected from the increased affinity. Clearly, in each panel, the high intensity 
peak moves to larger values as L increases from 78 to 500 bp. However, in very dilute [CI2] 
(right-most panels), short (78 ~ 300 bp) nop DNA molecules mostly had no CI2 to bound 
them, while large fraction of short op DNA had one CI2 bound. It was very likely that this 
CI2 molecule bound to DNA at the OrI sequence. 

To quantify these trends, we calculated the mean number (n) of bound Cl2's per DNA 
molecule in each frame as the mean red intensity normalized by the intensity of single CI2 
molecules. Figure O summarizes how (n) varies with L and concentration in the following 
4 cases: WT CI binding to nop (Fig. [6]A) and op DNA (Fig. [6|3), and the tight-binder 
Lys-34 CI mutant binding to nop (Fig. [6p) and op DNA (Fig. [6p). In each panel, (n) is 
plotted against L for each of the three or four [CI2] (nominal values stated in Fig. [6]). The 
measurements were repeated 2-3 times to check reproducibility. For a given [CI2] , we found 
that (n) increased linearly with L, and may be fit by a straight line. With increasing of 
[CI2], the slope s of the linear fits increased, initially in proportion to, but tending towards 
saturation at high [CI2] (Fig. [6]A, C and D). 

In general, the L-linear increase in (n) reflects increased availability of non-operator 
sites for binding. Significantly, if we look closely at the runs with op DNA (Fig. 03 and D), 
we find that (n) displays a finite intercept no, viz. (n) = no + sL. At low [CI2], the strong 
dominance of binding to the OrI site over non-operator sites renders the association rate 
nearly insensitive to L. 

Dissociation and association rate constants 

We first compare the binding strengths of WT CI2 to op and nop DNA embedded in two 
length extremes, 78 and 500 bp. Fig. [7]A shows the measured dissociation rate constant 
kd- Values of for nop DNA (open symbols) were found to be larger than the op case 
(solid symbols) by a factor 2-4. As expected, WT CI2 binds more strongly to op DNA 
compared to nop. Within the uncertainty of our experiments, was not significantly 
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different between the 2 lengths, i.e. a 6-fold increase in L had no observable effect on how 
long the CI2 remained bound. Three runs with nominally similar [CI2] are shown. 

The corresponding association rate constant Aq = A;a[Cl2] showed the opposite relation- 
ship between op and nop DNA. The values for Aq in op DNA (solid symbols) were larger 
than nop (open symbols) by a factor of 2 - 4 (Fig. 03), consistent with the higher affinity 
for op DNA. Moreover, the measured Aq for op DNA showed no significant dependence on 
L. This is expected because binding of CI2 to the single OrI site in op DNA dominates 
the op binding processes, and is independent of L. By contrast, Aq should increase with L 
for nop DNA. We indeed observed that Aq in WT-l-nop (open symbols) increased roughly 
4-fold between the 78-bp and 500-bp populations. We revisit this point later. 

To bolster these conclusions, we extended measurements to several DNA populations 
of intermediate L (100, 200, 300 and 400 bp's). In addition, we investigated the Lys-34 CI 
mutant, which binds more tightly than WT CI (I27!). Fig. [8] summarizes the dissociation 
rates fed measured with WT CI + nop DNA (solid symbols), WT CI -|- op DNA (open 
symbols) and for the tight-binder Lys-34 CI + op DNA (asterisks). For each DNA con- 
struct, op or nop, 6 patches of different lengths were investigated. The measured k^s were 
well separated in the 3 populations. In each case, the grey band is centered at the mean 
value of /cd, while its half-width gives the standard deviation. As expected, Lys-34 CI + 
op DNA showed the strongest binding (smallest kd), WT CI + op DNA was intermediate, 
while WT CI + nop DNA had the weakest binding (~10 times weaker than Lys-34 CI -|- 
op DNA). In addition, the results strongly reinforce the conclusion that the k^ in each case 
is unchanged by a 6-fold increase in L. 

The experiments for the WT CI + nop DNA and WT CI + op DNA were run 2 and 
3 times, respectively, with different sample preparations. The runs are shown as different 
symbols. Within the error limits, the values of k^ were insensitive to changes in [CI2], 
consistent with the expectation that dissociation should not be influenced by [CI2]. This 
insensitivity to [CI2] allows the data for fed from different runs to be meaningfully compared, 
as in Fig. [HI However, Aq, which we discuss next, is obviously L dependent. 

Fig. [9] plots A;a[Cl2] = Aq obtained in these experiments. For nop DNA (Panel A), Aq 
displayed a clear increase with L, which confirms the trend hinted at in Fig. [7J3, and is 



consistent with the results of (1281 ). The nominally L-linear increase is consistent with the 
expectation that a longer DNA target has a larger probability of capturing CI2. However, 
this is valid only for nop DNA. With op DNA, Aq showed no resolvable L dependence for 
both WT CI (Fig. EB) and Lys-34 CI (Fig. EP). Apparently, the OrI binding site, when 
present, dominates any advantage that increasing L might confer on repressor capture at 
the operator site. In contrast with kd, Aq increases by a factor of 4-5 for a ~6-fold increase 
in [CI2] (Fig. [9]) as we expect higher concentration leads to more binding events. 

Control gel-shift assays 

The association rates measured using the single-molecule microscopy method are two orders 



of magnitude smaller than the values in the literature (1271 ). To test if this discrepancy is 



caused by the experimental method or the experimental materials, gel-shift assays were 
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performed using the same DNA and CI as well as similar buffer in the single-molecule 
experiments. The reaction of 20 nM DNA and three CI2 concentrations, 5 nM, 36.5 nM 
and 452 nM, were investigated. The results are shown in Fig. [TOl 

The nop DNA captured CI2, which was also observed in the single-molecule measure- 
ments. About half of the DNA molecules were bound to CI2 at the CI2 concentration of 
452 nM. The equilibrium constant Kj) for nop and CI2 is estimated to be about hundreds of 
nM, similar to the single-molecule measurement results (150 ± 50 nM for 78 bp nop DNA, 
64 ± 9 nM for 500 bp nop DNA). 

For op DNA and WT CI, about half of the DNA molecules were bound to CI2 at a 
lower CI2 concentration, 36.5 nM, in agreement with our finding that op DNA has a higher 
affinity for CI2. The equilibrium constant J^d is estimated from the gel-shift assays result 
to be about tens of nM, similar to our single molecule results of 22 ± 3 nM. 

Compared with WT CI, the Lys-34 mutant of CI has higher affinity for op DNA. When 
CI2 concentration is larger than that of DNA, almost all the DNA molecules were bound. 
Only at the lowest CI2 concentration, 5 nM, did we observe some free DNA. We estimate 
that the dissociation constant Kd to be much smaller than 20 nM. Our single-molecule 
measurement suggest that it is 0.11 ± 0.02 nM. 



Discussion and conclusions 
Blinking and bleaching of the tdimer 

To investigate the behavior of tdimer: CI, we immobilized Cys-88 CI on glass surfaces with- 
out a PEG coating by non-specific binding. Recall that Cys-88 CI forms covalent dimers, 
which is the same as the smallest active units that binds DNA at the molarities explored 
here. When the protein concentration was low enough, we observed distinct bright spots 
with similar intensities in the red channel. With time, the intensities of these spots decrease 
gradually. 

It is known that green fluorescent protein exhibits "on-off" blinking behavior on the 
time scale of seconds (j29l ). For GFP blinking, the fractional on time is defined as = 
(Aton)/((Aton) + (Atoff)), where Aton (Atofr) is the intervals when the fiuorescence is on(off). 
Fon value depends on excitation intensity /ex such that Fon = h/ + /ex)- The saturation 



excitation intensity Is equals 1.5 kW/cm^ for S65T-GFP (1301 ). In our experiments, the 
excitation intensity is as low as 19 W/cm^. If the monomers of tdimer have similar /g, F^^ 
would be nearly 1, which means that the monomers would be in the on state most of the 
time. If the mean off time (Atofr) is on the order of seconds, like S65T-GFP, the mean on 
time (Aton) would be as long as the total exposure time in our experiment runs. 

Further studies with the red fluorescent protein DsRed showed that this closely packed 
tetramer protein has four distinctive intensity levels for each single molecule. Each intensity 



level has a life time of seconds (1311 ). However, little is known about the tdimer, which is a 
tandem dimer of two RFP monomers. Each CI2 has two independent tdimers. One would 
therefore expect that (tdimer:CI)2 would bleach in at lease two steps and possibly up to 
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four steps. However, the intensity change in each step would be small comparing with 
photon number fluctuations in our experiments. In addition, the DNA-CI2 complex has 
considerable rotation freedom in solution, which causes intensity fluctuation with linear 
polarized excitation illumination used here. For these reasons, then, we believe that the 
intensity loss of the (tdimer:CI)2 fusion due to bleaching appears as a smooth gradual 
process. Hence it is reasonable to identify the rapid and step-wise changes in intensity with 
the unbinding of CI2 from DNA. 



Single molecule vs. ensemble measurements 

Several bulk assays have been used to investigate DNA-protein interactions. The nitro- 
cellulose filter binding assay (jsl) is based on the fact that protein binds to nitrocellulose, 
trapping any DNA bound to it. In the DNase footprinting assay (jol) proteins protect 
DNA sequences from enzymatic cleavage when the proteins are specifically bound to the 



sequences. In gel-shift assays ( llOl ) protein-bound DNA migrates more slowly during elec- 
trophoresis. Calorimetry (p) is based on heat generation/absorption when proteins bind to 
DNA. Surface plasmon resonance (SPR) fjs^ has also been used to investigate the inter- 
action between molecules. 

Some of these bulk assays can provide binding profiles, while others provide binding dy- 
namics. All but the calorimetry experiments are nonequilibrium measurements. Compared 
with bulk assays, an advantage of SMFM is that it can provide both the binding profile 
and the binding dynamics in great detail. When analyzing binding dynamics, we can filter 
out multiple molecule interaction, leaving only the one-on-one DNA-protein interactions. 
We can also study association and dissociation in isolation with the reaction system in 
equilibrium. 



The observed rate constants vs. values in the literature 

The association rate constant we measured are considerably smaller than those measured 
with bulk assays (1271. ISSi). Could this be because of our relatively slow sampling rate, since 
the events that occur faster than the time interval between two consecutive frames would 
not be detected? First, we note that most of the binding events last for many sampling 
frames. It is not easy to see how we could miss fast on and off rates at the same site. 
Secondly, we were careful to control for this possibility by increasing our sampling rate in 
preliminary experiments, where we did not observe any significant changes in the observed 
rate constants. Increasing sampling rate caused the tdimer to bleach faster, resulting only 
in fewer events being recorded. 

A second possibility for the discrepancy between our rates and values in the literature 
may be that the fusion of the bulky tdimer prevents CI from dimerization or binding to 
DNA. CI molecules bind to DNA via the N terminus, and dimerize through the C terminus. 
Without the C terminus, the N-terminal domain alone is known to have much lower affinity 



{Kd ~ 1 fiM) than the intact CI protein (1341 ). We reason that, because the tdimer was 



fused to the N terminus, it should not affect CI dimerization. Moreover, according to 
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the crystal structure (l35l ). the N-terminus points away from DNA, so the tdimer should 
not affect the DNA binding either. As noted in Materials and Methods, these constructs 
confer immunity to superinfection. They have also been used to study oligomerization 



by fluorescence correlation spectroscopy (FCS) fl36l ). We compared the intensity of CI 
bound to DNA with the intensity of the Cys-88 covalent CI dimer. The Cys-88 mutant 
CI is confirmed to form dimer by comparing the molecular mass in both reducing and 
non-reducing SDS gels. As shown in Fig. IH the Cys-88 covalent CI dimer and tdimer:CI 
bound on short DNA molecules have similar fluorescence levels. In summary, the evidence 
suggests that these fusion products function as dimers, both in vivo and in vitro, and at 
concentrations in the nM range. 

Yet another possibility is that the glass surface may somehow severely restrict accessi- 
bility of CI to the DNA molecules, despite the long distance (40 bp, ~ 14 nm) between the 
operator sequence and the immobilization point. 

To address these questions, we conducted control gel-shift assays with the same DNA 
and CI molecules reacting in buffer solutions similar to that used in the single-molecule 
experiment. The results show that the affinity of the tdimer: CI is not as high as reported 
in the literature {e.g. ~ (2.9 ± 0.9) x 10"^^ M for op DNA and WT CI2 at 50 mM 



KCl) (1331 ). but rather similar to our single-molecule results [Kd = (2.2 ± 0.3) x 10~^ M for 
op DNA and WT CI2). However, the affinity difference between CI binding to op and nop 
DNA, as well as between WT and the tight binding Lys-34 mutant was confirmed in these 
fusion proteins. 

Binding dynamics suggested very short ID search lengths 

It is widely believed that transcription factors find their targets by first associating at non- 
operator sequence, and then diffusing along DNA using sliding and hopping to find their 
targets. Since hops are assumed to cover very short distances, they can be combined with 



sliding by scaling up the sliding length to the length of a ID search Se (1371 ). The interaction 
of CI2 with DNA can then be summarized with a 1D-3D model: CI2 spend its time either 
by diffusing 3D in the space or by sliding ID along the DNA. When CI2 jumps on(off ) the 
DNA molecule we will see abrupt red intensity increase(decrease) at the location of DNA. 
The dynamics should allow effective ID searching lengths of 10^ bp to be distinguished 
from those ~20 bp by comparing 78 to 500 bp DNA molecules. 

We consider first k^, for nop DNA. We expect it to scale with the length of DNA since 
doubling the DNA lengths doubles the cross section for capturing a CI2. A linear fit to our 
data shows that /ca increases linearly with L, but with an intercept /ca ~ + 200 bp. The 
reason for a non-zero intercept could be that immobilizing the DNA on a surface makes 
the two ends of DNA nonequivalent for CI2 capture. If so, the positive value suggests that 
the immobilized end captures CI2 more often than the free end. 

For op DNA, one would expect to measure a length-dependent k^, if CI2 associates with 
any sequence and then finds its target through ID diffusion. Instead, we observed that 
fca in op DNA exhibits little length dependence. This implies that CI2 associates with 
the operator sequence with much higher probability than with a non-operator sequence. 
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which is expected. It also imphes that the effective ID search length is shorter than 78 bp, 
the shortest DNA length in our experiments. Even so, one might still expect some length 
dependence since the Cl2's first interaction with DNA might be non-specific. However, the 
operator sequence is closer to the immobilized end and is always at the same distance from 
the immobilization point. The result with nop DNA suggests that the immobilized end 
and the free end behave differently. Because the immobilized end captures more CI2, we 
expect to see more CI2 binding to the operator sequence than CI2 binding with non-specific 
sequence farther from the immobilizing point. Thus the length dependence of fca associated 
with non-specific interaction may be weakened. 

As for fed, we observed clearly that CI2 bound to op DNA dissociates more slowly than 
CI2 bound to nop DNA. This confirms that the operator sequence has a higher affinity. 
Secondly, we see that Lys-34 CI has a smaller than WT CI when binding to op DNA. 
This confirms that the interaction is protein specific. As shown in Fig. [HI when affinity 
increases, fed decreases (1 x 10~^ s~^ for WT CI -|- nop DNA compared with 5 x 10~^ s~^ 
for WT CI + op DNA), by a factor of 2. For the tight-binder Lys-34 mutant CI + op, 
further decreases to 1.6 x 10"^ s~^ by a factor of 3. 

Apart from dissociation from locations along the length of the DNA, CI2 may also 
dissociate at one of the ends of the DNA molecules. However, immobilization of one end via 
biotin-streptavidin link the the surface likely prevents dissociation from that end. Instead, 
reflection is possible, and the CI2 returns to ID diffusion. For nop DNA, we observed no 
length dependence of k^. It is possible that the free DNA end is totally reflective, or ID 
diffusion covers only a small range of DNA, so that CI2 dissociates from a local sequence. 
For the first case, we would expect to see some k^ length dependence for CI2 and op DNA, 
since CI2 on shorter DNA encounters more reflections. In the second case (ID diffusion 
covers a very short distance), there will be no length dependence for op DNA either, since 
CI2 falls off the operator sequence before it can interact with the non-operator sequences. 
Our observation supports the second case. We also observed little length dependence of 
the tight-binder mutant Lys-34 CI2 to op DNA. 

Finally, as noted above, if k^ involves ID diffusion-and the evidence summarized here 
suggests it does not-then L-dependence should also appear in k^ measured as a function 
of L, since for simple Poisson events (Fig. [3p,D) the law of microscopic reversibility holds. 
That kd shows no such dependence for op sequences (Fig. [71A., [S]) strengthens our view that 
ID diffusion plays little if any role in target finding over a range of 78 to 500 bp. 

Extensive ID diffusion on non-specific DNA can slow the search to a rate that is even 
slower than 3D searching alone. However, in bacteria, target sequences for most repressors, 
including CI, lie very close to their coding sequences (1371 ). Moreover, the motion of single 
messenger RNA molecules in living cells was recently reported to follow a subdiffusion 
process (1381 ). This implies that the local concentration of repressor is higher - and the 
search volume smaller - than is often assumed. While k^. is seen to increase with L for 
binding to nop DNA (Fig.[9]A), we could not resolve any L dependence for op DNA (Fig.[9l3 
and C). This suggests to us that, although nonspecific binding of CI scaling with L is readily 
seen, ID diffusion is unimportant once an operator is present, which is more consistent with 
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this scenario. 

We note that proteins that are to bind and diffuse on DNA are often designed to clamp 



onto the DNA fl39l ) as they shde along. For those transcription factors with multiple distant 
targets, they often have high molarities in the cell. In the case of CI, once it is bound to 
the OrI and Or2, which lie adjacent to the cl coding region, cl expression is up-regulated, 
resulting in increasing concentrations of CI9. . The expression is not repressed until the low 
affinity binding site Or3 is also bound (H. l4ol). Increasing concentration then increases the 
probability that CI2 will binding to Ol sequences 2.4 kbp away, as well as the operator sites 
on superinfecting A DNA. Recent results show that CI transcription is further regulated by 
DNA looping induced by CI2 binding at OlI, Ol2, OrI and Or2 (0). DNA looping pulls 
the far binding site Ol also to the vicinity of cl coding region, increasing the probability 
of CI2 finding the two lower affinity sites Or3 and Ol3. 

In summary, we have used quantitative single molecule fluorescence microscopy to in- 
vestigate the interaction of CI2 with DNA molecules. We show that CI2 can bind DNA at 
both operator and non-operator sequences. However, Cl2's interaction with the operator 
sequence is not affected by the length of flanking non-operator sequences. Our data sug- 
gested that CI2 finds its target operator sequence mainly by 3D diffusion. 
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Table 1: Right primers used for each DNA sample 



DNA 


Primer seniienre 




5'-CCGAATTCCTCGAGTCTAGAGG-3' 


Tinn 1 on Y)Ti 


5'-TATCGATTTCGAACCCGGGG-3' 


nop 200 bp 


5'-GGAAACAGCTATGACCATGATTAC G-3' 


nop 300 bp 


5'-TAATGTGAGTTAGCTCACTCATTAGGC-3' 


nop 400 bp 


5'-CAATACGCAAACCGCCTCTC-3' 


nop 500 bp 


5'-GTGGATAACCGTATTACCGCCT-3' 


op 78 bp 


5'-CATGCGACGTCGGGC-3' 


op 100 bp 


5'-GAATTCCTCGAGTCTAGAGGAGC-3' 


op 200 bp 


5'-GCCAAGCTATTTAGGTGACACTATAGAA-3' 


op 300 bp 


5'-GGCACCCCAGGCTTTAC-3' 


op 400 bp 


5'-CGCGTTGGCCGATTCAT-3' 


op 500 bp 


5'-GAGTGAGCTGATACCGCTCG-5' 
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Figure Legends 

Figure [1], 

Sample preparation and single molecule imaging of CI-DNA interaction. (A) Schematic 
description of the PGR primers used here. The OrI sequence was inserted into a vector. 
A common PGR left primer with a 5' biotin and an internal Alexa Fluor 488 was used 
in each PGR reaction. The right primers were selected so that different primers produce 
different DNA lengths, nop DNA samples were synthesized using the vector lacking the 
OrI sequence as the PGR template. (B) Layout of 24 DNA patches in 4 channels defined 
by vacuum grease (yellow stripes) between two rectangular coverslips. Each channel had 
a different [GI2] (shown as different red opacities) and allowing us to compare different L 
DNA at the same GI concentration. (G) Schematic drawing of the reaction in one patch. 
Green fiuorescently labeled DNA molecules were immobilized on the coverslip surface. In 
solution, and also bound on DNA were (tdimer:GI)2 molecules (GI binds as a dimer). 
Both molecules were visualized by TIRF microscopy. (D) Superposition of red and green 
fluorescence images. Green spots are individual DNA molecules fixed to the coverslip 
surface. Red spots are tdimer:GI molecules. Golocalized DNA and GI, the yellow spots, 
indicate GI-DNA binding. 

Figure [2], 

Molecular bleaching. (A) Average bleaching curve of all molecules in a field of view as a 
function of time. The Alexa Fluor 488 fluorescence decreased 10% and the tdimer fluores- 
cence decreased 30% after 100 s of continuous illumination. (B, G) Intensity-time traces 
of two typical Alexa Fluor 488 molecules. The intensity dropped to the background level 
abruptly. (D - G) Intensity-time traces of four typical tdimer: Gys-88 GI molecules. Gom- 
pared with single Alexa Fluor 488 molecules, there was no abrupt large drop of intensity. 
With time, the intensities of these molecules decreased gradually. 

Figure [Sl 

Dynamics of GI2 interacting with DNA (WT GI and op 300 bp DNA are shown here). (A) 
Time trace of red fluorescent signal / vs. t from a single op DNA molecule displaying the 
transitions DPq ^ DPi. Bright intensities correspond to binding of a single (tdimer:GI)2. 
The bold line-segments are generated by the transition-recognition program that counts 
the events. (B) A single DNA molecule could be bound by more than one GI2. During 
5-9 min, this DNA was bound by two Gl2's. We filter out multiple binding events and 
accumulate statistics of dark intervals tdark and the bright intervals tbright using traces as 
in (A). The histograms of the distribution of tdark and tbright are shown in Panels G and D, 
respectively. Fits to exponential distributions 'P(tdark) and P(tbright) (bold curves) assuming 
Poisson processes yields Aq = [Gl2]fca and /ii = k^- (Also see main text) 
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Figure [H, 

Histograms of red fluorescence intensities from (tdimer:CI)2 (Lys-34 CI in this flgure) per 
green fluorescent spot (op DNA in this flgure), indicating the number of Cl2's per DNA. 
DNA lengths L are labeled in each panel. These DNA molecules were exposed to the same 
[CI2] . Two peaks are apparent in each panel. The narrow peak near zero intensity (aligned 
with the dashed line) is the DNA population with no bound CI2. The second peak is 
the DNA population bound with one molecule of CI2 {n > 1). As L increases, the peak 
broadens and shifts to higher intensity, reflecting the increase in number of Cl2's per DNA. 
A histogram of tdimer:Cys-88 CI intensities is plotted as a thick line in each panel as the 
dimer standard. 

Figure [H 

Evolution of the fluorescence intensities of Lys-34 CI bound to DNA versus DNA length 
L and [CI2]. Normalized frequencies of fluorescence in each intensity bin are shown as a 
heat map. The DNA is nop (op) in the upper (lower) panels. Each panel shows how the 
histograms change as L steps through the values 78, 100, 200, 300, 400, 500 bp, with [CI2] 
flxed at the value shown on the top. Blue represent zero frequency, while red represent 
high frequency. In each panel, as L increases, the center of the second peak shifts to larger 
CI2 number as the peak also gets broader, reflecting in increasing (n). Increasing [CI2] at 
flxed L also increases (n). At the same L and [CI2], op DNA captures more CI2 than nop 
DNA. 

Figure [61. 

The mean number (n) of Cl2's per DNA molecule plotted vs. L. (A) WT CI and nop DNA, 
(B) WT CI and op DNA, (C) Lys-34 CI and nop DNA, (D) Lys-34 CI and op DNA. In 
each panel, (n) is calculated as the mean red fluorescent intensity per DNA site, normalized 
by the intensity of single CI2. Straight lines are best linear flts to (n). Within each panel, 
[CI2] decreases ~3-fold between data sets with successively smaller slopes. The values (in 
nM) are 25, 11, 4.4, 1.7 (Panel A); 4.4, 1.7, 0.6 (B); 2.0, 0.71, 0.23, 0.07 (C); and 2.0, 0.71, 
0.23, 0.07 (D). 

Figure [T]. 

Direct comparisons of rate constants for binding of WT CI2 to nop versus op DNA. Panel 
A compares the dissociation rate constants kd for op DNA (solid symbols) with that for 
nop DNA (open symbols). For each DNA population, 2 lengths were investigated in 3 runs 
at the same nominal [CI2] of 11 nM (different symbols). Panel B shows the association rate 
constants extracted from the same experiments. 



Single molecule CI-DNA interaction 



23 



Figure [S 

Comparison of tlie dissociation rate constants measured for binding of WT CI to nop 
DNA (solid symbols) and op DNA (open symbols), and binding of the Lys-34 CI to op DNA 
(asterisks). Six DNA lengths were investigated (at fixed [CI2]) in each case. For WT CI + 
nop DNA, [CI2] was 11 nM in both runs (solid triangles). For WT CI + op DNA, [CI2] was 
11 nM in 2 runs (open squares and circles) and 1.7 nM in a third (open diamonds). In Lys- 
34 CI + op DNA, [CI2] was 0.07 nM. In each case, a grey band is centered at the mean value 
of kd while its half-width gives the variance. The observed values of kd increase significantly 
as we go from Lys-34+op to WT+op, to WT+nop (successively weaker binding of CI2). 
However, k^ does not display a resolvable L or [CI2] dependence. 

Figure [91 

The corresponding values of k^ extracted from the series of experiments involving the CI- 
DNA populations A: WT CI + nop DNA. The two different symbols are for two different 
experimental runs. B: WT CI + op DNA, and C: Lys-34 CI + op DNA shown in Fig. [HI 
Panel A shows that /ca in nop DNA increases linearly with L, but in the 2 op DNA pop- 
ulations, ka is virtually L independent. In Panel A, the nominal [CI2] ~ 11 nM for both 
open and solid triangles. In B, [CI2] is 11 nM (open square), and 1.7 nM (solid circles). In 
C, [(Lys-34 CI)2] is 0.07 nM. 

Figure [IS 

Gel-shift assays of DNA-CI reactions. Upper panel: 78 bp DNA. Lower panel, 500 bp 
DNA. DNA concentration in each lane were 20 nM. Lane 1: nop DNA control with no CI2. 
Lane 2 to 4, nop DNA interacting with increasing concentrations of WT CI2. Lane 5 to 
7, op DNA interacting with increasing concentrations of WT CI2. Lane 8 to 10, op DNA 
interacting with increasing concentrations of Lys-34 CI2. The three CI2 concentrations are 
5 nM (Lane 2, 5, 8), 36.5 nM (Lane 3, 6, 9) and 452 nM (Lane 4, 7, 10). 
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